PVRED: A Position-Velocity Recurrent Encoder-Decoder for Human Motion Prediction

نویسندگان

چکیده

Human motion prediction, which aims to predict future human poses given past poses, has recently seen increased interest. Many recent approaches are based on Recurrent Neural Networks (RNN) model with exponential maps. These neglect the pose velocity as well temporal relation of different and tend converge mean or fail generate natural-looking poses. We therefore propose a novel Position-Velocity Encoder-Decoder (PVRED) for makes full use velocities positional information. A position embedding method is presented RNN (PVRNN) proposed. also emphasize benefits quaternion parameterization design trainable Quaternion Transformation (QT) layer, combined robust loss function during training. provide quantitative results both short-term prediction in 0.5 seconds long-term 1 seconds. Experiments several benchmarks show that our approach considerably outperforms state-of-the-art methods. In addition, qualitative visualizations 4 could human-like meaningful very long time horizons. Code publicly available GitHub: https://github.com/hongsong-wang/PVRNN.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Recurrent Encoder-Decoder Network for Sequential Face Alignment

We propose a novel recurrent encoder-decoder network model for real-time video-based face alignment. Our proposed model predicts 2D facial point maps regularized by a regression loss, while uniquely exploiting recurrent learning at both spatial and temporal dimensions. At the spatial level, we add a feedback loop connection between the combined output response map and the input, in order to ena...

متن کامل

Semi-supervised Learning with Encoder-Decoder Recurrent Neural Networks: Experiments with Motion Capture Sequences

Recent work on sequence to sequence translation using Recurrent Neural Networks (RNNs) based on Long Short Term Memory (LSTM) architectures has shown great potential for learning useful representations of sequential data. A one-to-many encoder-decoder(s) scheme allows for a single encoder to provide representations serving multiple purposes. In our case, we present an LSTM encoder network able ...

متن کامل

RED-Net: A Recurrent Encoder-Decoder Network for Video-based Face Alignment

We propose a novel method for real-time face alignment in videos based on a recurrent encoder-decoder network model. Our proposed model predicts 2D facial point heat maps regularized by both detection and regression loss, while uniquely exploiting recurrent learning at both spatial and temporal dimensions. At the spatial level, we add a feedback loop connection between the combined output respo...

متن کامل

A study of the recurrent neural network encoder-decoder for large vocabulary speech recognition

Deep neural networks have advanced the state-of-the-art in automatic speech recognition, when combined with hidden Markov models (HMMs). Recently there has been interest in using systems based on recurrent neural networks (RNNs) to perform sequence modelling directly, without the requirement of an HMM superstructure. In this paper, we study the RNN encoder-decoder approach for large vocabulary ...

متن کامل

Position Tracking Control with Velocity from Accelerometer and Encoder

Being widely used in industrial systems and manufacturing lines, precision position control systems need to use high feedback control gains to reject disturbances. However, phase-lag in velocity estimation resulting from encoder measurement imposes a limitation on maximum allowable feedback gains, when system stability and control smoothness are concerned. In this paper, use of velocities deriv...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE transactions on image processing

سال: 2021

ISSN: ['1057-7149', '1941-0042']

DOI: https://doi.org/10.1109/tip.2021.3089380